arXiv:1507.08282vl [physics.soc-ph] 29 Jul 2015 


Common Knowledge on Networks 

Torrin M. Liddell* Simon DeDeo^ 
July 31, 2015 


Abstract 

Common knowledge of intentions is crucial to basic social tasks ranging from coop¬ 
erative hunting to oligopoly collusion, riots, revolutions, and the evolution of social 
norms and human culture. Yet little is known about how common knowledge leaves 
a trace on the dynamics of a social network. Here we show how an individual’s net¬ 
work properties—primarily local clustering and betweenness centrality—provide strong 
signals of the ability to successfully participate in common knowledge tasks. These sig¬ 
nals are distinct from those expected when practices are contagious, or when people use 
less-sophisticated heuristics that do not yield true coordination. This makes it possible 
to infer decision rules from observation. We also find that tasks that require com¬ 
mon knowledge can yield significant inequalities in success, in contrast to the relative 
equality that results when practices spread by contagion alone. 

In the game known as Rousseau’s Stag Hunt [l|, participants must decide between hunting 
a stag, or rabbits, before knowing what the others have chosen to do. Rabbits can be caught 
by individuals, but provide a lower reward than a stag. A stag can be caught, but only if 
others also decide to join the hunt, and a failed hunt is a serious setback. 

Even when an individual desires to join a hunt, she may not know that others know this. 
In the face of this uncertainty, she may go after rabbits instead and gain a meager, but 
relatively assured, beneht. Yet the other members of her group followed similar logic. All 
preferred to hunt stag, but all ended up hunting rabbits. This is the problem of common 
knowledge. 

The problem of the stag hunt is widespread in the modern world. An entrepreneur decides 
between a stable salary at a major corporation, or joining with others in a San Francisco 
startup. In determining whether potential team-mates are willing to forego their own back¬ 
up plans and commit to joint effort, he faces the same problem as Rousseau’s hunter. So, too, 
does an individual considering joining a demonstration against an unpopular regime, where 
small crowds may be violently dispersed, but safety, and political change, lies in numbers [^. 

Much work has been done on the mechanisms that can drive cooperation, but ensur¬ 
ing that all individuals involved have the capability to coordinate is an equally difficult 
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problem i- In the psychological sciences, this is often studied in the context of hidden 
prohle problems, where optimal decision-making requires the synthesis of both public and 
private information from multiple individuals. Successful completion requires the generation 
of common knowledge of the private information, a task which proves to be very difficult [^. 
However, under some circumstances, individuals are capable of surmounting this problem 
and building the proper common knowledge to reach the correct conclusion p, [^. 

Here, we study how this bounded logic plays out in real-world scenarios, with three 
increasingly sophisticated heuristics for solving the common knowledge problem in a social 
network. We provide a brief introduction to the theory of common knowledge. We then 
specify a series of three decision rules that interpolate between simple copying of others 
(contagion), and full common knowledge. We simulate these decision rules on realistic social 
networks. This enables us to derive a behavioral signature that indicates that individuals are 
making use of common knowledge in their decision-making process. We analyse both group- 
level outcomes, and the network properties of individuals that predict task participation. 

When common knowledge is required, we hnd that an individual’s participation in a 
task is predicted by high levels of clustering in their local neighborhood, and being in a 
central location in the graph. Importantly, this behavioral signature is not present in other 
potential network models for the adoption of behaviors. We thus provide a new diagnostic, 
at the network level, for tasks that require common knowledge. We also hnd that these more 
sophisticated rules lead to unequal outcomes: only some individuals are consistently able to 
join in risky cooperative tasks. 

1 The Theory of Common Knowledge 

At the heart of the stag hunt is a problem concerning the minds of others. If a group of three 
is needed, it is not enough for me to know that i and j are willing to join; I will want to 
know that i knows that j is willing—otherwise, I will worry that, presuming the absence of 
j, she will fail to appear. And, of course, if I want to know this, it is natural to assume that i 
will want parallel knowledge: that j knows that I am willing. And if I have reason to doubt 
that i knows that j knows that I am willing, again—given sufficiently low risk-tolerance—the 
hunt will fail. 

I may inform i that j knows that I am willing; but, of course, now I must inform j that i 
knows this, and return to i to inform her that j knows that—the regress, at least in a formal 
sense, is inhnite. What is needed is not only for all members of the group to have knowledge 
of the group’s willingness, but also for all members to know that all members know this, and 
also that all members know that all members know this, and so on. 

This is the problem of common knowledge. In order for a group to build common knowl¬ 
edge, all members of the group must communicate with one another, and this communication 
must be known to all members of the group. Formally, any game with multiple Nash equi¬ 
libria must solve the common knowledge problem [^, and Ref. provided a mathematical 
framework for describin g th e ideal case. But ordinary human coordination never achieves 
such an optimal state [9|, [lOl] . In the presence of imperfect knowledge and hnite understand¬ 
ing, we are forced to rely on heuristics such as those we present here. 

In the case above, for example, I might build common knowledge by collecting members 
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of my group together and having them declare their willingness, face-to-face, in public 0. 
Or, relying on bounded-rational decision-making, I might be content (in the three person 
case, with z, and j) to inform i and j about my willingness (and learn of theirs), relying on 
a link between i and j to convey their mutual willingness and mine to each other. 

I may not know (for exarnple) that i knows that j knows that I am willing, but this high 
a level of “level-fc thinking” [1^ is rarely observed to influence real-world behavior, even in 
high-stakes cases involving the potential for mid-air passenger jet collision [l^. In order to 
simulate this process we present, in the next section, a set of three increasingly sophisticated 
heuristics, or decision-rules, for how individuals might decide to participate in a cooperative 
task. 


2 Network Models of Common Knowledge 


The hrst rule is simple contagion, in which behaviors travel through the network only by ob¬ 
servation of behavior. Contagion is a basic mechanism for how behaviors and practices might 


be adopted on a network jl4j . Il5l | . In these models, cultural practices and task engagement 
amount to the spread of an infectious meme. 

The second rule is the egoist rule. When following this rule, individuals anticipate the 
behavior of others, but without simulating their meta-cognition. Individuals believe that the 
knowledge they possess is naturally shared by anyone they encounter. 

The hnal rule is that of common knowledge. Here, individuals make decisions in line with 
the reasoning presented above, simulating the meta-cognitive states of the other players and 
choosing to participate only when others are guaranteed to as well. This rule is realistically 
satishable using a heuristic that assumes that everyone else is an egoist. 

In the real world, success—the capture of the stag—is never guaranteed. Instead, indi¬ 
viduals must assess the relative rewards of the stag compared to the risk of a failed hunt 
due to insufficient numbers. We are thus concerned with how decision makers attempt to 
meet their own criteria for sufficient numbers using one of these three heuristics. In all three 
cases, as we shall see, it is possible for a player to decide to undertake the joint task, but to 
hnd herself without the desired number of participating neighbors. 


2.1 Common features of all rules 

All three rules have two free parameters. The hrst is the base threshold for behavioral 
adoption, 6. This baseline threshold regulates the personal threshold, 6i, for each node i. 
An individual’s 9i starts equal to the base threshold 9 but is distinct from it and can change 
over time. Both 9 and 9i are positive integers. 

The threshold 9i dictates the willingness of individual i to participate in the task. An 
individual node i desires that there be 9i likely-to-participate nodes in order to be willing 
to participate. When the node i is deciding whether to participate or not, the threshold of 
i {9i) is consulted. The node examines its neighbors, and if at least 9i — 1 are judged to be 
likely-to-participate, the node also participates. 

The threshold 9i has a number of interpretations that, while conceptually distinct, are 
equivalent for our purposes. In cases where the probability of a reward depends on the 
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number of participants, we can think of 6i as a measure of an individual’s risk tolerance. In 
cases where the reward is a step function in the number of participants, then 9i is an individ¬ 
ual’s (risk-adjusted) estimate of that threshold. Note that in situations where an objective 
outside criterion of a successful outcome {e.g., catching the stag in a stag hunt) exists, the 
participation of 9i individuals does not necessarily guarantee this objective criterion will be 
met. However, insofar as the individual threshold is reflective of the individual’s model of 
outcome probabilities as well as the utility associated with the outcomes, participating with 
9i total participants is still a success in decision procedure, if not in outcome. 

The notion of “likely-to-participate” is what varies across our three candidate models 
and forms the basis of the decision rule. As we shall see, individuals may be mistaken about 
others, and may hnd themselves participating in a group smaller than their desired threshold. 

The only other free parameter is the accidental participation rate a. All three models 
allow for a small chance to accidentally participate in a behavior. This means that even 
when an individual does not believe there are enough others that are likely to participate, 
there is a chance (equal to a) that the individual will participate anyway. Where relevant, 
the decision rules described below assume that every node knows only the local structure of 
the graph; a node is aware of who its neighbors communicate with. This rate is a simple way 
to model the effects of noisy communication and misunderstandings endemic to real-world 
systems. 

2.2 The contagion rule 

Contagion represents information transfer solely through the direct observation of behav¬ 
ior: “likely-to-participate” is dehned as “participated recently”. Each individual is willing to 
attempt a task if, and only if, they see enough of their neighbors participating in the previ¬ 
ous time step (barring “accidental” participation). Behaviors start when an early adopting 
individual decides they are willing to “join” a behavior despite no peers participating in 
the behavior (represented by the small accidental participation chance described above). In 
our earlier hunt example, this means that individuals will only go on a hunt if they saw 
a sufficient number of individuals go on a hunt yesterday. When this process is operating, 
individuals have very little motivation to participate in a behavior unless others have already 
done so. 

In more formal terms, the threshold of each node represents how many neighbors a 
node needs to see participating in a behavior before they adopt the behavior. Recall that 
nodes include themselves when comparing the number of likely participators to their own 
thresholds. For example, a node with a threshold of three needs to see two other individuals 



2.3 The egoist rule 

The egoist model represents individuals who perform a partial simulation of the decision 
making behavior of their neighbors. The simulation is only partial: no attempt to model the 
other player’s meta-cognition is made. 
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An egoist individual polls the willingness of his neighbors. If some group of size k are 
all willing to participate if at least k people total participate, and k is greater than then 
the egoist will participate. From a technical standpoint, the reasoning is insufficient: even 
though he knows that there is a willing group of sufficient size, he has not conhrmed that 
all members of the group have this information as well. The egoist assumes that everyone 
knows what he knows, and thereby justihes his name. Behaviors of this form are often seen 
in experiments la, llTI] and in some scenarios may be adaptive 
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Under such a rule, the egoist can end up in a scenario where he thinks he is certain that 
enough individuals will join him, but in fact he turns out to be the only one who has done so. 
To see how, recall our opening example, where h spoke to each potential hunter individually, 
and found that there was a sufficient number of hunters such that they would all be happy 
if they all attended. Under the egoist model, this standard of evidence is sufficient and H 
attends (and thus potentially attends an under-attended hunt, as described earlier). 

In formal terms, a node’s threshold represents the minimum size of the local group of 
willing individuals necessary for the node to participate. A given node i with threshold 6i 
will participate if and only if there is some subset of Fs neighbors (including i itself) G, such 
that G has size k > 6i and also that all group members g have thresholds 6g < k [i.e., there 
must be a group with a size at least equal to Fs threshold that all have thresholds less than 
or equal to the group size). 


2.4 The common knowledge rule 

This model is adapted from a model developed by Chwe j^, for the problem of common 
knowledge in collective action. Chwe primarily framed his model in terms of riot partic¬ 
ipation, but we extend this model to multiple classes of different cooperative phenomena, 
specihcally those where (unlike in Chwe’s model) opportunities for participation occur many 
times over the course of a given simulation. 

The common knowledge model describes scenarios where individuals not only attempt to 
predict the future behavior of other players, but do so in ways that model the knowledge of 
the other players. Under the common knowledge rule, an individual will only be considered 
likely-to-participate if they are part of a group of sufficient size for which it is common 
knowledge that all would be happy to participate if all did so together. 

In network terms, this means a node will only participate if the node is a member of a 
clique composed of k members, where all members of the clique have thresholds less than 
or equal to k. A clique is a group of nodes where all nodes have edges to all other nodes. 
We will refer to a clique of k individuals as a fc-clique. Intuitively, the clique requirement 
matches the concept of all individuals in a group being certain that all nodes know both the 
thresholds of all other nodes, as well as the knowledge state concerning the thresholds of all 
other nodes; that is, the thresholds of clique members are common knowledge among all the 
clique members. Formally, a node i will participate if and only if z is a member of a group G 
such that G is a fc-clique and any given group member g has a threshold 9g such that 9g < k. 

While this rule might appear to require an Aumann-like inhnite hierarchy [sj , it is equiv¬ 
alent to an individual modeling her neighbors as using the egoist rule and only participating 
when the number of certain-to-participate egoist neighbors is sufficient to satisfy her thresh¬ 
old. In the taxonomy of Ref. [1^, this is a form of level-one reasoning, with the zero level 
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Figure 1: A small example network. Capital letters indicate node names, 6 indicates each 
node’s threshold, and hlled circles indicate that the node adopted the behavior in the previous 
time-step. 


decision rule being egoist. 

2.5 Summary of rules 

We review the three decision rules by reference to Figure [H which presents a small ex¬ 
ample network of six nodes, labeled by their thresholds. Filled circles indicate nodes that 
participated in the previous time step. 

First, consider node A’s behavior under the contagion rule. A checks if its neighbors 
B, C, and D have participated in the previous time step. C and D have done so, and A’s 
threshold for participation is three; A will participate in the next round. Conversely, C has 
no neighbors who have recently participated, and also has a threshold of 3, and so will not 
participate in the next round (barring the accidental participation governed by rate a). 

Now consider the egoist decision rule. A consults the same set of neighbors, but now 
also considers their thresholds. A has three neighbors, all with threshold three. Including 
A itself, this makes four individuals willing to participate in a group of three or more, one 
more than necessary for A to participate, thus A, again, in this case, participates. 

Finally, consider A utilizing the common knowledge decision rule. The same set of 
neighbors are relevant, but now the connections between neighboring nodes are also relevant. 
A knows there are four individuals willing to participate in a group of three or more, but 
it also knows that each of these nodes does not have this information, as they do not have 
edges to one another. Therefore A will not participate. Conversely, node F knows there is a 
group of three nodes including itself willing to participate in a group of three or more, and 
that this fact is common knowledge among this group. This is because B, E, and F form a 
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clique of nodes, all whom communicate with one another. F will, under this decision rule, 
participate. 

2.6 Two Scenarios 

The three rules described above neatly partition the space of decision procedures for the 
adoption of a behavior or participating in a task. In this section, we place these rules in real- 
world contexts to see what effects they have on individuals and on the large-scale outcomes 
of the societies they belong to. 

We consider two distinct scenarios: one, “the hunting party”, closely mimics the original 
stag hunt problem. The other, “adopted fad”, considers cases where individuals might wish 
to adopt a new cultural practice conditional on the participation of others. These scenarios 
differ only in what happens to node thresholds after the adoption of the behavior in question. 

In the hunting party case, individuals who participate in a hunt show a subsequent (tem¬ 
porary) disinclination to participate during subsequent time steps. Every time an individual 
attends a hunt, her threshold is increased by one. If an individual does not attend a hunt and 
her threshold is above the base threshold, 0, she then decreases her threshold by one. This 
process stops once the base threshold is reached. This means that participation is usually 
followed by a decrease in enthusiasm that eventually builds back up to the base threshold. It 
models an individual who has either sated herself from the gains of successful cooperation, 
or has become temporarily risk averse subsequent to a failure to reach quorum. 

In the case of the adopted fad, adoption of the practice leads to a temporary increase 
in the enthusiasm to continue. When an individual participates in a fad, her threshold 
immediately drops to the minimum of one. Following this initial drop, her threshold increases 
by one for each additional round of participation. After an individual stops participation, her 
threshold decreases by one if it is above the base threshold, or otherwise remains the same. 
This means that participation is usually followed by further participation, but eventually 
this participation wanes. 

We consider two notions of success. The hrst is “participation success”: simple adoption 
of the behavior, whether or not the individual reaches quorum. Under this notion, the 
success condition is simply whether or not an outside observer would see an individual 
attempt to participate in the task at hand (hunting party), or adopt the behavior in question 
(adopted fad). We can also measure “quorum success”: the fraction of the time an individual 
both participates and meets her threshold 6i. For simplicity, we show here the results for 
participation success. As we show in the Appendix, all of the major conclusions we reach 
here also hold for quorum success. 

Both the hunting party and adopted fad scenarios share a few basic elements: the struc¬ 
ture of the network, the assignment of “thresholds”, and the rate of random adoption. We 
consider networks randomly generated using the Watts-Strogatz generation method [l^; for 
our simulations here, each network consists of 100 nodes, with an average of eighteen edges 
and a j3 (the probability that a local edge becomes a random edge) value of 0.1, implemented 
in the NetworkX python package j^. This generation method produces graph structures 
more realistically similar to the small-world structure commonly observed in naturally oc¬ 
curring networks jl^. All edges are bidirectional; in a cooperative endeavor it is natural to 
assume two-sided communication. 
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Scenario 

Decision Rule 

Gini Goefficient 

Hunting Party 

Gontagion 

0.055 


Egoist 

0.221 


Gommon Knowledge 

0.159 

Adopted Fad 

Gontagion 

0.029 


Egoist 

0.158 


Gommon Knowledge 

0.147 


Table 1: Inequalities of outcome: the egoist and common knowledge rules produce far more 
unequal outcomes (higher Gini coefficients) than the more “egalitarian” contagion process. 


For the same choice of 9 and a, the three decision rules lead to different overall rates of 
participation; we thus choose parameter pairs, rule by rule, that lead to the same average 
rate of participation. 


3 Results 

Given a decision rule, and a scenario, we can study two core topics of interest: (1) the 
distribution of success within the society as a whole, and (2) the predictors of individual 
success. 

Remarkably, while the hunting party and the adopted fad scenarios seem conceptually 
distinct, we find very similar results for both society-wide and individual-level outcomes. 
The main difference is in overall rates of success; because enthusiasm for the hunt declines 
immediately, it is difficult to get more than 50% participation among individuals. 

In both scenarios, we find that both common knowledge and egoist decision rules lead 
to high inequalities in outcome. Societies that rely upon these more cognitively-complex 
rules have some individuals who are able to achieve the task a large fraction of the time, 
while others find themselves rarely, if at all, able or willing to participate. Gontagion, by 
contrast, leads to highly equal rates among individuals; not only does the practice reach 
nearly everyone in the society, it does so at roughly equal rates. 

This is shown visually in Fig. [21 in Table [H we report the average Gini coefficients for 
these same simulated networks, which confirms these results; the egoist rule shows slightly 
higher levels of inequality compared to common knowledge and both are much higher than 
simple contagion. Interestingly, in Fig. [2] a third mode appears in the adopted fad / common 
knowledge case; these appear to be nodes on the boundary between two densely connected 
subgroups. These peripheral nodes can be triggered when both subgroups adopt the fad 
simultaneously. 

Rule-by-rule, differences in participation rate are driven, in large part, by the relative 
position of individuals in the network. To study this formally, we model each individual 
node’s participation as a linear function of three predictors: 

1. Degree. The number of neighbors the node has. 

2. Betweenness Gentrality. The proportion of shortest paths between all node pairs that 
pass through the node. 
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Figure 2: Distributions of individual participation rates for the hunting party scenario (top 
row) and fad adoption (bottom row). In both cases, egoist and common knowledge decision 
rules lead to highly unequal outcomes: some nodes have very high rates of participation 
compared to others, who participate rarely, if at all. Differences in overall averages between 
the two scenarios represent the greater difficulty of achieving high rates of participation in 
the hunt scenario, where enthusiasm is sated after a successful hunt. 
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3. Clustering. The proportion of the pairs of neighbors a node has that are themselves 
neighbors to each other. 

We use hierarchical multiple linear regression, with the parameters of the model estimated 
via Bayesian methods. The model is hierarchical in that we estimate regression parameters 
for each simulation, with the simulation-level parameters distributed in a higher-level normal 
distribution^ 

The data (including success and all three predictors) were standardized by 2 ;-score. This 
means that the weights are interpretable in terms of standard deviations: a weight of one on 
a predictor indicates that a one standard deviation increase in the predictor predicts a one 
standard deviation increase in success. By looking at the weights on each these properties, 
we can determine which properties matter, and how much. 

For the hunting party, degree is a relevant predictor for all decision rules; the higher 
the degree, the more likely a node is to participate in the task. Betweenness centrality 
is positively associated with success for the common knowledge rule, negatively related to 
success in the egoist rule, and unrelated to success in the contagion rule. Clustering is 
strongly related to success in the common knowledge rule, and is not meaningfully related 
to success in either of the other two rules. For the adopted fad model, the contagion model 
is characterized by no credibly non-zero weights, the egoist model is characterized by a 
strong positive weight on degree, a negative weight on betweenness centrality, and near¬ 
zero weight on clustering, and the common knowledge model is characterized by a positive 
weight on degree, a positive weight on betweenness centrality, and a strong positive weight on 
clustering. The results of this analysis are shown in Fig. [21 details on the Bayesian posterior 
probabilities are presented in the Appendix. 

Despite the structural differences between the hunting party and adopted fad scenarios, 
a strong weight on clustering remains the primary, and unique, signature of a common 
knowledge process. To a lesser extent, betweenness centrality is also a unique predictor of 
common knowledge tasks, again independent of scenario]! 


4 Discussion 


Common knowledge may seem like an egalitarian decision rule: to build it requires mutual 
communication among all members, and when the knowledge is present, all may join in. But 
this is only a local equality: those who successfully build common knowledge together may 
be equals, but they have vastly greater ability to succeed compared to those who cannot. A 
natural distinction between local equality and global inequality emerges from these tightly 
connected subgroups. 


^For more details on Bayesian multiple regression, see 
to Bayesian data analysis see Ref. 


Chapter 18. For a more general introduction 


23|. 


^In the egoist model, our results indicate that higher betweenness centrality is predictive of lower levels of 
success in scenarios similar to the fad case. This suggests that when individuals do not utilize meta-cognitive 
information about others in deciding to cooperate, being centrally located is worse than being peripheral, 
holding degree and clustering constant. We suspect that some nodes with high betweenness in this condition 
are often actually placed between two or more locally connected subgraphs. The spread of participation 
tends to remain in these smaller subgraphs, meaning that high betweenness individuals have less exposure 
to this spreading, and so are less likely to participate in a fad. 
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Inequality in human societies has often been linked to multiplicative gains, such as those 
from return on capital [^, 25|, made by a subset of the population. Our results here suggest 
that network position may play a distinct role in the creation of inequality. Studying network 
properties should allow us to make a further distinction between sources of advantage: simple 
access to others (egoist model), or membership within tightly-knit communities (common 
knowledge). 

Inequality also persists across generations, and success appears to be inherited by a 
number of distinct mechanisms, including both direct wealth transfer and schooling [^. If, 
as seems likely, social network position is inherited, our results provide another mechanism to 
explain intergenerational correlations in wealth and status. If I inherit my network position 
from a previous generation, I will inherit the returns of risky cooperative action. 

These results contrast with contagion processes, where those with advantageous positions 
in the society have less ability to make use of them. Behaviors that spread via contagion 
on average tend to reach each node with reasonable frequency, with node properties playing 
only a minor role in overall success. 

Local clustering is an important predictor of success in the common knowledge model. 
It is not just that—as one expects—tightly knit communities enable common knowledge. It 
is also that related heuristics are insensitive to these same properties. Egotists and copiers 
do not beneht from being in highly-clustered subgroups. 

This relationship can potentially be used to detect the presence of common knowl¬ 
edge tasks. Potential sources of networks include social networking websites, academic co¬ 
authorship, records of e-mail or cellphone communication, and sharing of employees between 
divisions or of residents between cities and nations. Potential tasks of interest may range 
from literal hunting or decentralized military action to interdisciplinary collaboration, the 
construction of large-scale open source projects or knowledge commons, and problems of 
political or social change. The analysis framework presented here allows the detection of 
common knowledge decision rules in any task that occurs on a network. 

The social networks of indigenous populations may provide a particularly clear test of the 
results presented here. The “rational ritual” HH theory provides an explicit account of the 
connection between group rituals and the desire to generate common knowledge. At the same 
time, we know from fieldwork that rituals of this form do not necessarily include all members 
of the society. Only some people get invited to certain parties, rituals, or gatherings. 

In the case of the chicha (beer-drinking) parties among the Tsimane’, for example, invi¬ 
tations are both selective and strongly reciprocal, over and above kin effects j^. This leads 
to an incompletely-connected graph with high local clustering. In societies such as these 
observation of differential success, as well as more detailed information on correlated timing, 
could conhrm a behavioral process reliant on common knowledge heuristics. 

Although we do not consider it here, cognitive load matters. Even in the small village 
population of Ref. 27|], it may be difficult for individuals to track the risk-tolerance of all 


their neighbors. As societies become larger, and the pool of potential cooperators widens, the 
problem only becomes more acute. The cost of tracking my neighbors, their thresholds, and 
their connections to each other may lead to errors of judgment. In response to load, individ¬ 
uals may adopt new heuristics not considered here. Individuals may, for example, consider 
subsets of their neighbors, chosen by an egoist-like rule, and save effort by considering only 
the links within this subgroup. 
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Our three rules—contagion, egoist, and common knowledge—interpolate between differ¬ 
ent levels of cognitive sophistication. Recent work in cultural learning has led to a wide 
taxonomy of simple heuristics that fall between these three anchor-points, including the 
copying the decisions of the majority, of those who have previously been successful, or of 
a random individual 28|. The need to model the minds of others in common knowledge 


decision-making suggests that network properties should still be sufficient to separate these 
processes from rules that observe individuals, but do not model their internal states. 

Our results here have used the language of risky tasks and fad adoption. Common 
knowledge is more than this, however. Iniplicated in everything from wage negotiation to 
land reform, it is a source of both stability [i^, and change in a society’s institutions. 
In periods of upheaval, when conventions are rethought or rejected, common knowledge is 
damaged and individuals lose the ability to rely on their knowledge of what their neighbors 
will believe and accept. A historical example is provided by the chaotic state of the financial 
system during the French revolution at the end of the 18th Century. The emergence of 
multiple forms of credit-notes meant that individuals were forced to decide what forms of 
payment to accept based on what they expected others would later accept from them 
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Common knowledge requires that people communicate not just facts, but mutual knowl¬ 
edge of their states of mind. The difficulty of establishing common knowledge thus extends 
to electronic networks, even though communication costs there are low. The non-state cur¬ 
rency BitCoin has drawn the attention of banking institutions, but it is only secondarily 
a source of value. Its proof-of-work protocol is a mechanism of achieving massively decen¬ 
tralized common knowledge about the contents of a ledger [^. In the case of Wikipedia, 
the need for common knowledge of norms leads to elaborate systems of norm discussion and 
referencing and, even more literally, pages such as “Assume the assumption of good 
faith’l that explicitly prescribe beliefs users should hold about the beliefs of others. The 
era of electronic communication has only diversified the nature of the common knowledge 
problem. 


5 Conclusion 

The need for common knowledge extends well beyond the problem of cooperative hunting, 
to include a vast range of endeavors, from joining a risky start-up to protesting an unjust 
regime. For cooperation in risky or complex tasks to be successful, participants must not 
only be motivated to undertake the task, but must also have the capability to successfully 
coordinate their efforts. 

Indeed, much of human culture is not simply what we do, but what we do given the 
mutual expectations in our social worlds. Common knowledge problems lie beyond simple 
contagion, and are some of the clearest examples of how culture is more than just the spread 
of memes adopted through observation alone. 

Our results show that decision rules sensitive to the need for common knowledge leave 
clear traces on the behavior of individuals in a social network. Our results also show that 
common knowledge provides a new mechanism for the generation of inequality, over and 
above models that rely upon multiplicative accumulation of wealth. 

•^See https://en.Wikipedia.org/wiki/Wikipedia:Assume_the_assmnption_of_good_faith 
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Appendix 

In this Appendix, we present details of our simulation, and the parameter choices and jus- 
tihcations that form the basis of our results above. 

We also present data on “quorum success”, a measure related to “participation success” 
used to quantify the results presented in the main text. As noted there, and as we show in 
detail here, the outcomes associated with quorum success are consistent with participation 
success. 


A Simulation procedure 

For participation success, our outcome of interest is participation in a behavior. In the 
following simulations, we operationalize success as the number of timesteps the node partic¬ 
ipates in the behavior over the period of the simulation. A single simulation begins with a 
randomly generated graph, generated as described in Methods, Sec. 12.61 above. Following 
this, each of the three models is run on this graph structure for 200 rounds of potential 
cooperation. 

We model the decreasing enthusiasm described above as follows: every time an individual 
attends a hunt, their threshold is increased by one. If an individual does not attend a hunt 
and their threshold is above the base threshold, she then decreases her threshold by one. 
This process stops once the base threshold is reached. 

For the hunting party case, we choose parameters such that, for each decision rule, the 
success rates have means close to 80: 9 = 7 and a = 0.01 for the common knowledge model, 
0 = 6 and a = 0.01 for the egoist model, and 0 = 5 and a = 0.05 for the contagion model. 
For the adopted fad case, where participation is easier, we choose parameters such that the 
success rates have means close to 150: 0 = 8 and a = 0.01 for the common knowledge model, 
0 = 18 and a = 0.01 for the egoist model and 0 = 4 and a = 0.05 for the contagion model. 

B Details on Inequality Calculations 

B.l Hunting Party 

At each investigated parameter set, we generated 20 graphs using the generation method 
described above, and applied all three models to each graph. We hxed the parameters of 
each model such that each model had an average success rate of approximately 80 rounds 
participating out of the 200. For the common knowledge model this meant 0 = 7 and 
a = 0.01, for the egoist model this meant 0 = 6 and a = 0.01, and for the contagion model 
this meant 0 = 5 and a = 0.05. The number of rounds spent participating was recorded 
for each node. Then the distribution of success across this sample of networks was plotted 
as a histogram, as shown in Figure [H We found that the distribution of success was more 
unequal for the egoist and common knowledge models, with clusters of high success and low 
success individuals, whereas the contagion model demonstrated comparative equality, with 
a single unimodal cluster. To quantitatively assess the inequality encouraged by the two 
models, we computed the Gini coefficient |2l| of the distribution of success for each model. 
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Figure 3: Posterior distributions on the regression weights for each of the three models in the 
hunting party scenario (left column) and the adopted fad scenario (right column). Clustering 
and (to a lesser extent) betweenness centrality are strong and unique predictors of task 
participation in the common knowledge case, but not in the less-sophisticated contagion and 
egoist models. 
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The Gini coefficient is a commonly used measure of inequality that measures the 

deviation of a distribution (in our case, the distribution of success) from perfect equality, on 
a scale from 0 to 1. A Gini coefficient of 0 indicates perfect equality, with all individuals 
having equal values of success, and a Gini coefficient of 1 indicates total inequality, with a 
single individual having high levels of success and all others have none. The Gini coefficient 
for the common knowledge distribution is 0.159, the coefficient for the egoist distribution is 
0.221, and the Gini coefficient for the contagion model is 0.055. We also computed the Gini 
coefficients for each of the 20 generated networks individually, instead of collapsed across all 
the generated networks, as presented here. For all networks, the Gini coefficient is smaller for 
the contagion model than for either the common knowledge model or the egoist model. This 
supports the qualitative judgment that inequality is encouraged by the common knowledge 
and egoist models, and not by the contagion model. 


B.2 Adopted Fad 

We again investigated the distribution of success across nodes in each of the three models. 
In this scenario, we chose parameter values such that each model had an average success 
rate of approximately 150 rounds participating out of 200. Participation lasts for multiple 
rounds at a time in this scenario, thus 150 rounds out of 200 is comparable to the 80 rounds 
of success utilized in the hunt scenario. For the common knowledge model, we set 0 = 8 and 
a = 0.01, for the egoist model we set 0 = 18 and a = 0.01, and for the contagion model 
we set 0 = 4 and a = 0.05. Visual inspection indicated a pattern similar to the previous 
scenario: under these parameter values the egoist and common knowledge models produce 
unequal and multimodal distributions of success, whereas the contagion model produces 
relative equality and a unimodal distribution. Analysis of the Gini coefficient conhrms this: 
the common knowledge distribution has a Gini coefficient of 0.147, the egoist distributions 
has a Gini coefficient of 0.158, and the contagion distribution has a Gini coefficient of 0.029. 

We again computed the Gini coefficient for each network individually, and again found 
that the contagion Gini coefficient never exceeded the Gini coefficient for either of the other 
two models. This again quantitatively confirms that the common knowledge and egoist 
models demonstrate higher levels of inequality than the contagion model. 


C Details on Node Property Calculations 

C.l Hunting Party 

We generated 50 random graphs. Then, for each graph, we applied all three models and 
recorded information about each node’s success, as well as a set of node properties (described 
below). We used the same parameter values described in the previous section for the three 
models. However, it should be noted that the results reported here are robust to changing 
parameter settings: in all parameter settings we tested where meaningful regression was 
not ruled out (graphs with no variation in number of successful hunts) the same qualitative 
pattern of regression weights was observed. 
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To analyze the relationship between success and node properties we used a hierarchical 
multiple linear regression, with the parameters of the model estimated via Bayesian methods. 

The weight on degree is credibly non-zero and positive for the common knowledge (Mode 
= 0.372, 95% HDI from 0.339 to 0.407), egoist (Mode = 0.541, 95% HDI from 0.508 to 
0.573), and contagion (Mode = 0.315, 95% HDI from 0.295 to 0.334) models. 

To assess whether a parameter value is credibly non-zero, we compare the 95% highest 
density interval (HDI) of the posterior distribution to the null value. The 95% HDI is 
analogous to a conhdence interval, but represents the bulk of the distribution such that no 
parameter value outside the HDI has greater credibility than parameter values inside the 
HDI. The null value is represented by a region of practical equivalence (ROPE) around the 
null value. For all comparisons presented here, we used a ROPE of -0.1 to 0.1 on the scale of 
the standardized regression coefficients. If the HDI falls outside this range, we conclude that 
the parameter is credibly non-zero. If the HDI falls entirely with this range, we conclude 
that the parameter value is practically equivalent to zero. 

The effect of betweenness centrality differs slightly across the three models: it is a 
marginally positive predictor of success in the common knowledge model (Mode = 0.156, 
95% HDI from 0.098 to 0.208), a marginally negative predictor in the egoist model (Mode 
= -0.119, 95% HDI from -0.176 to -0.067), and is practically equivalent to zero in the con¬ 
tagion model (Mode = 0.012, 95% HDI from -0.022 to 0.039). In the case of clustering, 
the differences across the model are stark. The weight is credibly positive in the common 
knowledge model (Mode = 0.592, 95% HDI from 0.542 to 0.643), positive but not credibly 
non-zero in the egoist model (Mode = 0.069, 95% HDI from 0.021 to 0.115) and practically 
equivalent to zero in the contagion model (Mode = 0.024, 95% HDI from -0.008 to 0.051). 
The strong weight on clustering is unique to the common knowledge model. This is the 
primary signature that common knowledge is operating: a common knowledge decision rule 
predicts that those with densely connected local neighborhoods will be the most successful. 

C.2 Adopted Fad 

In this scenario, the weight on degree is credibly non-zero for the common knowledge (Mode 
= 0.275, 95% HDI from 0.243 to 0.312) and egoist (Mode = 0.900, 95% HDI from 0.874 to 
0.925) models, but is practically equivalent to zero for the contagion model (Mode = -0.003, 
95% HDI from -0.041 to 0.038). Moreover, the effect of degree in the egoist model is much 
greater than the weight in the other two models. The effect of betweenness centrality differs 
across the three models: it is a credibly positive predictor of success in the common knowledge 
model (Mode = 0.346, 95% HDI from 0.295 to 0.401), a credibly negative predictor in the 
egoist model (Mode = -0.212, 95% HDI from -0.246 to -0.173), and is marginally positive in 
the contagion model (Mode = 0.126, 95% HDI from 0.064 to 0.188). The case of clustering 
is very similar to the hunt scenario. The weight in contagion (Mode = 0.097, 95% HDI from 
0.038 to 0.165) is marginally positive, the weight in the egoist model (Mode = -0.042, 95% 
HDI from -0.074 to -0.007) is practically equivalent to zero, and the weight in the common 
knowledge is credibly positive (Mode = 0.745, 95% HDI from 0.688 to 0.795). This again 
conhrms that a major signature that common knowledge is operating is that those with 
densely connected local neighborhoods will be most successful. 
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Scenario 

Decision Rule 

Gini Goefficient 

Hunting Party 

Gontagion 

0.047 


Egoist 

0.420 


Gommon Knowledge 

0.163 

Adopted Fad 

Gontagion 

0.028 


Egoist 

0.145 


Gommon Knowledge 

0.144 


Table 2: Inequalities of outcome: In the alternative success definition, the egoist and common 
knowledge still rules produce far more unequal outcomes (higher Gini coefficients) than the 
more “egalitarian” contagion process. As in the histograms, the egoist decision rule in the 
hunting party scenario is the only major area of difference: due to the lack of the large 
group of high performers, the level of inequality is higher under this success rule than the 
participation success rule. 

D Quorum Success 

In the analyses presented so far, our operationalization of success was participation. Another 
potential operationalization of success is a combination of participation and the presence 
of enough participating neighbors to meet the node’s threshold. We call this definition 
“quorum success”. Under the quorum definition, participation is determined by the decision 
rules described above for each model, and following this participation decision, the number 
of participating neighbors is counted and the node is successful only if this count meets 
the node’s threshold. Non-attempting rounds are not counted as successes regardless of the 
number of participating neighbors, just as before. 

To ensure that the results presented here were generalizable, we implemented this rule 
and repeated the analyses presented above on the success behavior that resulted. 

D.l Inequality 

Our analysis here uses the same procedure described previously, with the same parameter 
values for each of the three models. Figure H] shows the distribution of success under the 
quorum rule, and Table [2] reports the Gini coefficients of the success distributions for each of 
the models in both scenarios. The results are very similar to the original success definition 
with one exception: the egoist model in the hunting party scenario. In this scenario, many 
times an egoist node will see neighbors that seem to them to be willing to participate, but in 
fact do not have sufficient neighbors themselves to participate. This means that participation 
often occurs in scenarios where not enough neighbors participate to meet a node’s threshold. 
The result of this is that many nodes that were highly successful under the participation 
success rule are now unsuccessful in the quorum success rule. This is reflected in the lack 
of a second high mode in the distribution of success, and a very high Gini coefficient, as a 
much smaller proportion of the individuals have high success. 
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Figure 4: Distributions of individual participation rates for the hunting party scenario (top 
row) and fad adoption (bottom row) using the operationalization of success described in 
this section. Notice that the patterns are very similar to Figure [2] with the exception of the 
Egoist distribution in the hunting party scenario. Success in this circumstance no longer has 
a large group of individuals who all have high levels of success. This is because the egoist 
decision rule in the hunt scenario often encourages participation when a node’s neighbors 
are not guaranteed to participate. 
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D.2 Node Properties 

We also analyzed the properties that make a node successful using the quorum dehnition of 
success, again using parameters and analysis methods identical to those previously described. 
Figured shows the posterior distributions on the regression weights in both scenarios using 
the quorum success rule. As in the inequality analysis, the results in the fad scenario are 
largely identical to the participation success rule; all credibly non-zero effects remain credibly 
non-zero, and no new credibly non-zero effects are observed. The results in the hunt case also 
closely mirror the inequality results: the egoist model is unique in that it has importantly 
different results when using the quorum success rule than when using the participation 
success rule. The egoist model has a credibly negative effect of degree (where before it 
was positive) and a practically equivalent to zero effect of betweenness (where before it was 
marginally negative). This negative effect of degree is at first puzzling: despite degree being 
a strong positive predictor of participation (and participation being a necessary condition for 
success) it is actually a negative predictor of success. We believe this effect results from those 
with high degree being at higher risk of becoming “out of sync” with their neighbors. Recall 
that in the hunt scenario, individuals are less likely to hunt shortly after they have already 
hunted. If a node has a small but sufficient number of neighbors that tend to successfully 
hunt together, adding additional neighbors could cause the node to participate as a result of 
these new neighbors when the smaller (but successful) hunting group is still in an unwilling 
to participate state. If this larger group is not synced, this will lead to failure. Following this, 
the node will itself be unwilling to participate, and the small successful group will attend 
without it. 

D.2.1 Quantitative Details: Hunt Scenario 

Here we present quorum success results for the hunt scenario.. The weight on degree is 
credibly non-zero and positive for the common knowledge (Mode = 0.377, 95% HDI from 
0.345 to 0.415), credibly negative for the egoist model (Mode = -0.144, 95% HDI from -0.177 
to -0.105), and credibly positive for contagion (Mode = 0.208, 95% HDI from 0.196 to 0.220) 
models. The effect of betweenness centrality differs slightly across the three models: it is 
a marginally positive predictor of success in the common knowledge model (Mode = 0.090, 
95% HDI from 0.040 to 0.147), and a practically equivalent to zero effect in the egoist model 
(Mode = 0.042, 95% HDI from -0.020 to 0.099) and the contagion model (Mode = 0.023, 
95% HDI from 0.010 to 0.039). As before, the case of clustering has large differences across 
the models. The weight is credibly positive in the common knowledge model (Mode = 0.544, 
95% HDI from 0.494 to 0.593), positive but not credibly non-zero in the egoist model (Mode 
= 0.124, 95% HDI from 0.068 to 0.172) and practically equivalent to zero in the contagion 
model (Mode = 0.038, 95% HDI from 0.024 to 0.053). Once again, the strong weight on 
clustering is unique to the common knowledge model. 

D.2.2 Quantitative Details: Adopted Fad 

Here we present quorum success results for the adopted fad scenario. In this case, the weight 
on degree is credibly non-zero for the common knowledge (Mode = 0.278, 95% HDI from 
0.241 to 0.311) and egoist (Mode = 0.783, 95% HDI from 0.754 to 0.811) models, but is 
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practically equivalent to zero for the contagion model (Mode = -0.004, 95% HDI from -0.042 
to 0.034). The effect of betweenness centrality again differs across the three models: it is a 
credibly positive predictor of success in the common knowledge model (Mode = 0.353, 95% 
HDI from 0.298 to 0.410), a credibly negative predictor in the egoist model (Mode = -0.182, 
95% HDI from -0.227 to -0.137), and is very slightly positive in the contagion model (Mode 
= 0.044, 95% HDI from -0.017 to 0.106). Finally, for clustering the weight in the common 
knowledge model is credibly positive (Mode = 0.742, 95% HDI from 0.684 to 0.802), the 
weight in the egoist model (Mode = 0.015, 95% HDI from -0.021 to 0.062) is practically 
equivalent to zero, and the weight in contagion (Mode = 0.001, 95% HDI from -0.049 to 
0.062) is practically equivalent to zero. This again confirms that a major signature that 
common knowledge is operating is that those with densely connected local neighborhoods 
will be most successfnl. 
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Figure 5: Posterior distributions on the regression weights for each of the three models in 
the hunting party scenario (left column) and the adopted fad scenario (right column) using 
the quorum success rule. The adopted fad results are nearly the same as the results using 
the participation success rule. The results from the hunt scenario are very similar to the 
results using the participation success rule, with the exception of the weight on degree in 
the egoist model, which is now negative. 
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